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Finite Automata and Their Decision Problems; 


Abstract: Finite automata are considered in this paper as instruments for classifying finite tapes. Each one- 
tape automaton defines a set of tapes, a two-tape automaton defines a set of pairs of tapes, et cetera. The 


structure of the defined sets is studied. Various generalizations of the notion of an automaton are introduced 


and their relation to the classical automata is determined. Some decision problems concerning automata are 
shown to be solvable by effective algorithms; others turn out to be unsolvable by algorithms. 


Introduction 


Turing machines are widely considered to be the abstract 
prototype of digital computers; workers in the field, how- 
ever, have felt more and more that the notion of a Turing 
machine is too general to serve as an accurate model of 
actual computers. It is well known that even for simple 
calculations it is impossible to give an a priori upper 
bound on the amount of tape a Turing machine will need 
for any given computation. It is precisely this feature that 
renders Turing’s concept unrealistic. 

In the last few years the idea of a finite automaton has 
appeared in the literature. These are machines having 
only a finite number of internal states that can be used 
for memory and computation. The restriction of finite- 
ness appears to give a better approximation to the idea of 
a physical machine. Of course, such machines cannot do 
as much as Turing machines, but the advantage of being 
able to compute an arbitrary general recursive function 
is questionable, since very few of these functions come 
up in practical applications. 

Many equivalent forms of the idea of finite automata 
have been published. One of the first of these was the 
definition of “nerve-nets” given by McCulloch and Pitts. 
The theory of nerve-nets has been developed by authors 
too numerous to mention. We have been particularly in- 
fluenced, however, by the work of S. C. Kleene? who 
proved an important theorem characterizing the possible 
action of such devices (this is the notion of “regular 
event” in Kleene’s terminology). J. R. Myhill, in some 
unpublished work, has given a new treatment of Kleene’s 
results and this has been the actual point of departure 
for the investigations presented in this report. We have 
not, however, adopted Myhill’s use of directed graphs as 
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a method of viewing automata but have retained through- 
out a machine-like formalism that permits direct com- 
parison with Turing machines. A neat form of the defini- 
tion of automata has been used by Burks and Wang! 
and by E. F. Moore,* and our point of view is closer to 
theirs than it is to the formalism of nerve-nets. However, 
we have adopted an even simpler form of the definition 
by doing away with a complicated output function and 
having our machines simply give “yes” or “no” answers. 
This was also used by Myhill, but our generalizations to 
the ‘“nondeterministic,” “two-way,” and ‘“many-tape” 
machines seem to be new. 

In Sections 1-6 the definition of the one-tape, one-way 
automaton is given and its theory fully developed. These 
machines are considered as “black boxes” having only a 
finite number of internal states and reacting to their en- 
vironment in a deterministic fashion. 

We center our discussions around the application of 
automata as devices for defining sets of tapes by giving 
“yes” or “no” answers to individual tapes fed into them. 
To each automaton there corresponds the set of those 
tapes “accepted” by the automaton; such sets will be re- 
ferred to as definable sets. The structure of these sets of 
tapes, the various operations which we can perform on 
these sets, and the relationships between automata and 
defined sets are the broad topics of this paper. 

After defining and explaining the basic notions we 
give, continuing work by Nerode,® Myhill, and Shep- 
herdson,’ an intrinsic mathematical characterization of 
definable sets. This characterization turns out to be a 
useful tool for both proving that certain sets are definable 
by an automaton and for proving that certain other sets 
are not. 

In Section 4 we discuss decision problems concerning 
automata. We consider the three problems of deciding 
whether an automaton accepts any tapes, whether it ac- 


cepts an infinite number of different tapes, and whether 
two automata accept precisely the same tapes. All three 
problems are shown to be solvable by effective algo- 
rithms. 

In Chapter II we consider possible generalizations of 
the notion of an automaton. A nondeterministic automa- 
ton has, at each stage of its operation, several choices of 
possible actions. This versatility enables us to construct 
very powerful automata using only a small number of 
internal states. Nondeterministic automata, however, 
turn out to be equivalent to the usual automata. This fact 
is utilized for showing quickly that certain sets are defina- 
ble by automata. 

Using nondeterministic automata, a previously given 
construction of the direct product of automata (Defini- 
tion 7), and the mathematical characterization of defina- 
ble sets, we give short proofs for various well-known 
closure properties of the class of definable sets (e.g., the 
definable sets form a Boolean algebra). Furthermore we 
include, for the sake of completeness, a formulation of 
Kleene’s theorem about regular events. 

In trying to define automata which are closer to the 
ideal of the Turing machine, while preserving the im- 
portant feature of using only a preassigned amount of 
tape, another generalization suggests itself. We relax the 
condition that the automaton always move in one direc- 
tion and allow the machine to travel back and forth. In 
this way we arrive at the idea of a two-way automaton. 
In Section 7 we consider the problem of comparing one- 
way with two-way automata, a study that can be con- 
strued as an investigation into the nature of memory of 
finite automata. A one-way machine can be imagined as 
having simply a keyboard representing the symbols of 
the alphabet and as having the sequence from the tape 
fed in by successively punching the keys. Thus no perma- 
nent record of the tape is. required for the operation of 
the machine. A two-way automaton, ‘on the other hand, 
does need a permanent, actual tape on which it can run 
back and forth in trying to compute the answer. Surpris- 
ingly enough, it turns out that despite the ability of back- 
wards reference, two-way automata are no more power- 
ful than one-way automata. In terms of machine memory 
this means that all information relevant to a computation 
which an automaton can gather by backward reference 
can always be handled by a finite memory in a one-way 
machine. 

In Chapter III we study multitape machines. These 
automata can read symbols on several different tapes, 
and we adopt the convention that a machine will read 
for a while on one tape, then change control and read on 
another tape, and so on. Thus, with a two-tape machine, 
a set of pairs of tapes is defined, or we can say a binary 
relation between tapes is defined. Using again the power- 
ful tool of nondeterministic automata, we establish a 
relationship between two-tape automata and one-tape 
automata. Namely, the domain and range of a relation 
defined by a two-tape automaton are sets of tapes defina- 
ble by one-tape automata. From this follows the fact 
that, unlike the sets definable by one-tape automata, the 


relations definable by two-tape automata do not form a 
Boolean algebra. The problems whether a two-tape au- 
tomaton accepts any pair of tapes and whether it accepts 
an infinite number of pairs are shown to be solvable by 
effective algorithms. 

We conclude with a brief discussion of two-way, two- 
tape automata. Here even the problem whether an au- 
tomaton accepts any tapes at all is not solvable by an 
effective algorithm. Furthermore a reduction of two-way 
automata to one-way automata is not possible. All in all, 
there is a marked difference between the properties of 
one-tape automata and those of two-tape automata. The 
study of the latter is yet far from completion. 


Chapter I. One-tape, one-way automata 
o 1. The intuitive model and basic definitions 


An automaton will be considered as a black box of which 
questions can be asked and from which a “yes” or “no” 
answer is obtained. The number of questions that can be 
asked will be infinite, and for simplicity a question is in- 
terpreted as any arbitrary finite sequence of symbols 
from a finite alphabet given in advance. An easy way to 
imagine the act of asking the question of the automaton 
is to think of the black box as having the separate sym- 
bols on a typewriter keyboard. Then the machine is 
turned on and the question is typed in; after.an “end of 
question” button is pressed, a light indicates a “yes” or 
“no” answer. Other good images of how the automaton 
could appear physically would use punched cards. Sup- 
pose that we punch just one symbol or code number for 
a symbol to a card; then a question is simply a stack of 
cards, The automaton is asked a question by having the 
stack read in a card at a time in the usual way. 

For the purposes of this paper, we shall not use either 
of the above images but rather think of the questions as 
given on one-dimensional tapes. The machine will be 
endowed with a reading head which can read one square 
of the tape (i.e., one symbol) at a time, and then it can 
advance the tape one unit and read, say, the next square 
to the right. We assume the machine stops when it runs 
out of tape. So much for the external character of an 
automaton. 

The internal workings of an automaton will not be 
analyzed too deeply. We are not concerned with how the 
machine is built but with what it can do. The definition 
of the internal structure must be general enough to cover 
all conceivable machines, but it need not involve itself 
with problems of circuitry. The simple method of ob- 
taining generality without unnecessary detail is to use 
the concept of internal states. No matter how many wires 
or tubes or relays the machine contains, its operation is 
determined by stable states of the machine at discrete 
time intervals. An actual existing machine may have 
billions of such internal states, but the number is not 
important from the theoretical standpoint—only the fact 
that it is finite. 

As a further simplifying device, we need not consider 
all the intermediate states that the machine passes 
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through but only those directly preceding the reading of 
a symbol. That is, the machine first reads a symbol or 
square on the tape, then it may pass through several 
states before it is ready to read the next symbol. To be 
able to mimic the action of the automaton, we need not 
remember all these intermediate states but only the last 
One it goes into before it reads the next square. In fact, 
if we make a table of all the transitions from a state and 
a symbol to a new state, then the whole action of the 
machine is essentially described. 

Finally, to get the answer from the machine, we need 
only distinguish between those states in which the “yes” 
light is on and those states in which the “no” light is on 
when the end of the question is reached, Again, for sim- 
plicity, it is assumed that all states are in one category or 
the other but not in both. Thus the whole machine is 
described when a class of designated states corresponding 
to the “yes” answers is given. It remains now to give a 
precise mathematical form to these ideas. 

First a finite alphabet E is given and fixed for the rest 
of the discussion. The actual number of symbols in the 
alphabet is not important. It is only important that all 
the automata considered use the same alphabet so that 
different machines can be compared. For illustration we 
shall often think of © as containing only the two symbols 
0 and 1. By a tape we shall understand any finite se- 
quence of symbols from £. We also include the empty 
tape with no symbols to be denoted by A. The class of 
all tapes is denoted by T. If x and y are tapes in T, then 
xy denotes the tape obtained by splicing x and y together 
or by juxtaposing or concatenating the two sequences. In 
other words, if 


X= 0901.. Oh] 

and 

Y=Ty71 see Tht 

then 

XY = OO, s e Op TyT1 + ++ Tnet 


where the o’s and 7’s are in =. We assume as obvious the 
two laws 


Ax=xA=X, 
and 
X(yz) = (xy) z, 


for all x, y, z in T. In mathematical terminology, T to- 
gether with the operation of juxtaposition forms the free 
semigroup (with unit) generated by =. 

We shall often have occasion to cut tapes into pieces. 
For example, let 


X=OyO] - + Or 


the o’s are in = and n is referred to as the length of the 
tape x. We adopt the following notation 


Sd bore Le eB «++ Or_ys 


where k=/=n. In other words y; is a section of x run- 
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ning from the (k+1)st symbol of x through the / 
symbol. Clearly, the length of „x; is ]—. We will agree 
that if k=l, then ,x,;= A, the tape of length 0. As a useful 
property of the notation, we have 


X= Xk Xs 

where k&n, or more generally 
Xm RA Ams 

where k=2l=m==n, 


We shall refer to such tapes as oxy as the initial section 
or initial portion of x of length k. 

The obvious notation x” for xxx...x multiplied to- 
gether n times will also be used with the convention that 
xo A. 

Having explained alt the notations for the tapes that 
will be fed into the machines, we turn now to the formal 
definition of an automaton. 

Definition 1. A (finite) automaton over the alphabet 
X is a system 9-=(S,M,59,F), where S is a finite non- 
empty set (the internal states of Y), M is a function de- 
fined on the Cartesian product SX> of all pairs of states 
and symbols with values in S (the table of transitions or 
moves of Ñ), So is an element of S (the initial state of 
XO, and F is a subset of S (the designated final states 
of Y). l 

Let 9{ be an automaton. First of all the function M 
can be extended from $XE to SXT in a very natural 
way by a definition by recursion as follows: 


M(s,A) =s, for s in 5; 
M(s,x0) =M(M(s,x),o), for s in S, x in T, and o in £. 


The meaning of M(s,x) is very simple: it is that state 
of the machine obtained by beginning in state s and read- 
ing through the whole tape x symbol by symbol, chang- 
ing States according to the given table of moves. It should 
be at once apparent from the definition of the extension 
of M just given that we have the following useful prop- 
erty: 


M(s,xy}=M(M(s,x),y), for all s in S and x,y in T. 


We may now easily define the set of those tapes which 
cause the automaton to give a “yes” answer. 

Definition 2. The set of tapes accepted or defined by 
the automaton YI, in symbols T(J), is the collection of 
all tapes x in T such that M (sọ,x) is in F. 

Definition 3. The class of all definable sets of tapes, in 
symbols T, is the collection of all sets of the form T(Q) 
for some automaton Y. 

The meaning of acceptance can be made clearer by a 
diagram. Let x=o9... 6,1. For each k=n, let 


5e=M (Soort), 
so that for k>0 we have 
Sy M (8,0 % 1) è 


The condition that x be in T() is that’s, be in F. 
Each s, is the state of the machine 9{ after reaching the 


kt symbol in the tape x. Thus if we write down the fol- 
lowing diagram: 


Tù oO; To ae Fái 


So Sy So Sg as a Shy Sns 


we have a complete picture of the motion of the machine 
Q across the tape x. It is very important to notice in this 
picture that there is exactly one more internal state than 
there are symbols on the tape, a fact that will be used 
several times in Section 4. 


è 2. A mathematical characterization of definable sets 


An automaton can be a very complicated object, and it is 
not clear exactly how complicated the sets definable by 
automata can become. In order to understand the nature 
of these definable sets, we will develop in this section a 
mathematically simple and completely intrinsic charac- 
terization of these sets, which shows exactly the effect of 
considering machines with only a finite number of in- 
ternal states. This “finiteness” condition is certainly the 
main feature of our study. 

Actually two different characterizations will be given, 
but they share a common feature of involving equiva- 
lence relations over the set T of all tapes. The reader is 
assumed familiar with the notion of an equivalence 
relation and equivalence classes. 

Definition 4, An equivalence relation R over the set T 
of tapes is right invariant if whenever xRy, then xzRyz 
for all z in T. 

Clearly there is an analogous definition of left-invari- 
ant equivalence relations. 

Definition 5. An equivalence relation over the set T is 
a congruence relation if it is both right and left invariant. 

If R is a congruence relation then the formulas xRz 
and yRw always imply xyRzw. In consequence, if [x] is 
the equivalence class containing x, and [y] is the equiva- 
tence class containing y, then we can define unambigu- 
ously the product of the two equivalence classes by the 
equation 


[x] J=[+y]. 


In mathematical terms, the set of equivalence classes is 
said to be the quotient semigroup of T under the con- 
gruence relation R and is called a homomorphic image 
of T. There are many distinct homomorphic images of 
T, but we shall be most interested in those that are finite. 
Somewhat more generally we shall make use of equiva- 
lence relations satisfying the following definition. 

Definition 6. An equivalence relation over T is of 
finite index if there are only finitely many equivalence 
classes under the relation. 

With these definitions, we may now state the first result 
on characterizing definable sets. This theorem is due to 
J. R. Myhill and is published with his kind permission. 

Theorem 1. (Myhill) Let U be a set of tapes. The fol- 
lowing three conditions are equivalent: 

(i) UisinT; 

Gi) U is the union of some of the equivalence classes 
of a congruence relation over T of finite index; 


(iii) the explicit congruence relation = defined by the 
condition that for all x,y in T, x==y if and only if fer all 
z,w in T, whenever zxw is in U, then zyw is in U, and 
conversely, is a congruence relation of finite index. 

Proof: Assume (i) and in particular that U=7T(9[) 
for a suitable automaton Y. Define a relation R by the 
condition that xRy if and only if M(s,x)=M(s,y) for 
all s in S. Clearly R is an equivalence relation, but it is 
also a congruence relation. For assume that xRy and z 
is any tape in T. Then 


M(s,xz) =M(M(s,x),z) 
=M(M(s,y),z) 


=M(s,yz), for all s in S. 


Thus ÆR is right invariant. Likewise 


M(s,zx) =M(M(s,z),x) 
=M(M(s,z).y) 
=M(s,zy), 


and R is shown to be left invariant. 

That R is of finite index is a consequence of the fact 
that if x is a fixed tape and r is the number of internal 
states of N, then the expression M(s,x) can assume at 
most r different values. Thus the number of equivalence 
classes is at most 7”. 

Finally if x is in T(9{) and xRy, then M(sy,x) = 
M (soy) so that y is in T(9[) also. This remark shows 
that U=T(9) is in fact the union of the equivalence 
class under R of those tapes in U. We have thus shown 
that (i) implies (ii). 

Assume next that statement (ii) holds, and let R now 
stand for any congruence relation satisfying the condi- 
tions mentioned in (ii). Consider the specific relation = 
defined in (iii) in terms of U. Let x and y be any tapes 
such that xRy. Suppose that zxw is in U. Now R is a con- 
gruence relation, so that zxwRzyw. On the other hand U 
is a union of equivalence classes. Thus zyw must also be 
in U. This argument actually shows that if xRy, then 
x==y. In other words, = is a relation making fewer dis- 
tinctions than the relation R. That == is a congruence 
relation is a trivial consequence of its definition, so if R 
is of finite index, then == must necessarily be of finite 
index too. Hence, (ii) implies (iii). 

Finally, assume that (iii) holds. We must define an 
automaton YX such that U=T(9). To this end, let § be 
the set of equivalence classes under the congruence 
relation =. Define the function M by the formula: 


M([x],o)=[re], 


where the square brackets indicate the formation of 
equivalence classes. Notice we need only the fact that 
== is right invariant to see that the definition of M is 
unambiguous. Further, let sy=[A], and finally let F be 
the set of all [x] where x is in U. It should be obvious 
that U is indeed a union of equivalence classes under =. 
A simple inductive argument shows that if M is extended 
in the way indicated in Section 1 to the set SxT, then 
M((x],y) =[xy] for all x,y in T. Thus we see at once 
that M(s9.x)=M({A],x) =[x] is in F if and only if x 


for all s in S, 
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is in U; in other words U=T(9{), as was to be shown. 
Hence, (iii) implies (i), and the proof of Theorem 1 is 
complete. 

The main trouble with Theorem 1 is that the number 
of equivalence classes under the relation = can become 
very large as is indicated in the proof that (i) implies 
(ii). To be more economical and to stay closer to the 
simpler automata defining the set U, one should use only 
right-invariant equivalence relations rather than demand- 
ing congruence relations. The following theorem is for- 
mulated in an exactly parallel fashion to Theorem 1 and 
is essentially a simplification of a theorem by A. Nerode,° 
who used a somewhat more involved notion of automa- 
ton than that adopted here. The principle is very useful 
and was employed by J. C. Shepherdson? in a proof of 
the main theorem of Section 7, as is explained there. 

Theorem 2. (Nerode) Let U be a set of tapes. The 
follawing three conditions are equivalent: 

(i) Uisin TS; 

(ii) U is the union of some of the equivalence class- 
es of a right-invariant equivalence relation over T of 
finite index; 

(iii) the explicit right-invariant equivalence relation 
E defined by the condition that for all x,y in T, xEy tf 
and only if for all z in T, whenever xz is in U, then yz is 
in U, and conversely, is an equivalence relation of finite 
index, 

The proof need not be given in detail because it can 
be copied almost word for word from the proof of 
Theorem 1. It should only be mentioned that the rela- 
tion R in the proof that (i) implies (ii) has the simpler 
definition: 


xRy if and only if M (sx) =M (Soy). 


This implies that the number of equivalence classes for 
R is at most the number of internal states of Y. This re- 
mark and an analysis of the full proof leads directly to 
the following corollary. 

Corollary 2.1. 7f U is in T, then the number of equiva- 
lence classes under the relation E is the least number of 
internal states of any automaton defining U. 

In other words, the relation E leads at once to the 
most economical automaton defining U. This remark is 
is also due to Nerode. 

As a simple application of Theorem 1, we shall show 
that the set U of all tapes of the form 010" for 
n=0,1,2,...is not definable by any automaton. Suppose 
to the contrary that U is in J. Consider the relation = 
of Theorem 1 (iii). This relation would have to be of 
finite index, so that for some integers n==mm we would 
have 0*=:0", It follows at once that 0"10"=0"10", 
and hence that 0710” is in U, which is impossible. 
Thus U cannot be in J, 


è 3. Closure properties of the class of definable sets 


Using the theorems just given in the preceding section, 
we can derive very simply some facts about the class 7. 
It turns out that J can be actually characterized by its 
closure properties under some natural operations on sets 
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of tapes, but the discussion of this fact will be delayed to 
Section 6. Sometimes it is easier to use Theorems 1 and 
2 and sometimes it is easier to give direct constructions 
of machines. In this section we shall indicate how the 
Boolean operations can be done in both ways. First, 
however, we prove two theorems that seem to be easier 
by the indirect method. ; 

Theorem 3. Zf x is in T, then {x}, the set consisting 
only of x, is in T. 

Proof: Clearly an automaton can be built which rec- 
ognizes one and only one tape given in advance; how- 
ever, Theorem 2 is easier to apply. The relation E de- 
fined in Theorem 2 (iii) in terms of U={x} simply 
means that yEz if and only if whenever y and z are initial 
segments of the tape x, then y=z. Thus & has one equiva- 
lence class for each initial segment of x and one extra 
equivalence class for all the rest of the tapes. Obviously 
E then is of finite index, which completes the proof. 


If x is any tape, then it can be turned end-for-end and 
written backwards. Let x* stand for the result of writing 
x backwards so that if x=op0).-..o,—1, then x*= 
oo. Clearly we have the rules: 


Opn —-1Ty,—2°+° 

o*= 0, for cin $, 
Atma, 

neh 

and 

Graya 


In case U is any set of tapes, U* will denote the set of 
all x* where x is in U. / 

The motion of an automaton, according to the defini- 
tions of Section 1, is always from left to right. Thus from 
the original definition, the following result is a little 
surprising. 

Theorem 4. /f U is in TJ, then U* is in T. 

Proof: The content of the theorem is that if a set of 
tapes is definable, then so is the set obtained by writing 
all the defined tapes backwards. The direct construction 
of a machine defining U* from a given machine defining 
U is rather lengthy, but Theorem 1 makes the result al- 
most obvious. Let == be the relation defined in terms U 
from Theorem 1 (iii) and let ==* be the analogous rela- 
tion for U*. Assume that x==*y. If zx*w is in U, then 
(zx*w)* is in U*. But (zx*w)*=w*xz*. Hence, w*yz* 
is in U* also; however, w*yz*=(zy*w)*, and so zy*w 
is in U. This shows that x*==y*, Since U**=U, this ar- 
gument with U and U* interchanged is also valid, and 
we have proved that xs=*y if and only if x*=y*, for 
all x,y in T. Clearly then, if == is of finite index, then ==* 
must be also of finite index with the same number of 
equivalence classes, which completes the proof. 

Theorem 5. The class J is a Boolean algebra of sets. 

Proof: That the class J is closed under complements is 
the most obvious fact, even from the original definition. 
For if U=T(2) where Y=(S,M so F), then T—U= 
T(V), where B=(S,M,s9,S—F). One need only prove 
in addition that J is closed under intersections. Suppose 


that U, and U, are in J. By Theorem 2, let R; and R, 
be two right-invariant equivalence relations of finite in- 
dex such that U; is a union of equivalence classes under 
R; for i=1,2. Consider the equivalence relation R3= 
R,(\Rz, in other words xRy if and only if xRy and 
xRoy. Rg is, of course, right invariant. Every equivalence 
class under Rg is an intersection of equivalence classes 
under R, and R. Hence, the number of equivalence 
classes for Rs is at most the product of the numbers for 
R, and Ro. We see, then, that R is of finite index. Now 
U,(\U2 is simply a union of intersections of the two 
kinds of equivalence classes, so that U,(\U, is a union 
of equivalence classes under Rg, which shows that 
U,C\U,g is in J by Theorem 2. The proof is complete. 

Corollary 5.1. The class J contains all finite sets of 
tapes. 

This is a direct consequence of Theorems 3 and 5. 

The proof of Theorem 5 may seem too abstract. To 
niake it more direct, we show next how to form at once 
a machine defining the intersection. 

Definition 7. Let 9=(S,M,s),F) and 8=(T,N,to,G) 
be two automata. The direct product 9x is that au- 
tomaton (SXT,MXN, (Soto), FXG) where SXT and 
FXG are the Cartesian products of sets, (So,to) is the 
ordered pair of Sọ and tọ, and the function MXN on 
(SX T) x3 is defined by the formula 


(MXN) ((s,t),0) =(M(s,0) ,N(t0) ) 


for all s in S, t in T, and o in >. 
Theorem 6. 7f 9{ and 8 are automata, then 


TMX B) =TOQDAT(B). 


Proof: An obvious inductive argument shows that for all 
tapes x we have (MXN) ((s,t),x) =(M(s,x) N(x) ) 
for all s in S and ¢ in T. Now x is in T(x) if and 
only if (MXN) ((59,to) +) =(M(59,x) ,N(to,x)) is in 
FXG. This in turn is equivalent to the conjunctions of 
conditions that M (sox) is in F and N(to,x) is in G; in 
other words, x is in T(2{)/\7T(B), as was to be shown. 


e 4. The emptiness problem 


Suppose someone gave you an automaton Y= (S,M,50,F) 
without telling you what it was supposed to do. The gift 
might turn out to be an elaborate practical joke, and 
T(X) could very well be empty. Now a person would 
not want to spend the rest of his life feeding all the in- 
finite number of possible tapes into the machine if all 
the answers are going to be the same. Thus one would 
like to know an upper bound on the number of tapes 
that need be tried to determine whether the machine is 
of any use. Such an upper bound is supplied by the next 
theorem. 

Theorem 7. Let JYJ be an automaton. Then T(J) is 
not empty if and only if Ņ accepts some tape of length 
less than the number of internal states of Y. 

Proof: We need only establish the implication from 
left to right. Assume that T(9[) is not empty and indeed 
that x is a tape in T({) of minimal length. Let n be the 
length of x and let r be the number of internal states of 


X. By way of contradiction, assume that r=n. It follows 
at once that there must exist integers k</<=n such that 


M (So,0%,) =M (S0041); 


where 9x, and 9x, are the initial segments of x of length 
k and J. Consider the tape x’= x; “n Which is shorter 
than x. We have 


M (Sox) =M (So0¥k Xn) 
=M (M (Sos0%x) Xn) 
=M (M (S950%1) Xn) 
=M (S00X1 Xn) 
=M (Sox) 


because x=oX; xn. Hence x’ must be in J(9{) also, which 
contradicts the minimum conditions on x and proves 
that n<r. 

Corollary 7.1. Given a finite automaton Y there is an 
effective procedure whereby in a finite number of steps 
it can be decided whether T(Q) is empty. 

The corollary is an immediate consequence of the fact 
that Theorem 7 shows that only a finite number of tapes 
that need be tried, and any one tape can be run effectively 
through a machine once the table of moves has been 
given. It is also possible to give a simple necessary and 
sufficient condition of a similar nature for T(9{) to be 
infinite. We precede that result by a lemma. 

Lemma 8. Let Y be an automaton with r internal 
states. Let x be a tape in T(M) of length n. If rÆn, then 
there exist tapes y,z,w such that x=yzw, z=EA, and all 
the tapes yz™w are in T(9) for m=0,1,2.... 

Proof: As in Theorem 7, there must exist integers 
k=l=n such that 


M (So,0Xx) =M (S00%1) á 


Let y=oXp_, Z=kXp W=;Xņ. Since k<l, we see that z--A. 
Clearly x=yzw, and yz=ox;,, hence M(5),y) =M (soz). 
It follows then at once by induction that M (soy) = 
M (Ss ,yz™). Whence, we derive 


M (Sox) =M (so,yzw) 
=M(M(5o,yz),¥) 
=M(M(59,yz™),w) 
=M(sy,yz™w). 


Thus all the tapes yz”w are also in T(Q). 

Theorem 9. Let 9{ be an automaton with r internal 
states. Then T({) is infinite if and only if it contains a 
tape of length n with r=n=2r. 

Proof: The implication from right to left is a direct 
consequence of Lemma 8. Assume that 7(9[) is infinite. 
The alphabet = is finite, and so 7() must contain 
tapes of length greater than any integer. Let x be a tape 
in T(N) of length n=r. As in the other two proofs, 
there must exist integers k <l=Æn such that 


M (So,0Xx) =M (so,0%1) - 


Now take a new tape x which is of minimal length of 
any tape in T(9[) for which integers k</ exist satisfying 
the above equation. Assume further that / is the least 
such integer =n=the length of x. We no longer know 
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that nær. Thus if i<j<l, then 
M(S9,0%4) =M (S0043) . 


Since there are at most r values for the function M to as- 
sume, this proves that /=r. Further, if =i<j=n, then 


M (S0s0%4) EM (S9,9%;) ’ 


since otherwise the tape x’= 9%, jx, would be a shorter 
tape than x satisfying the given conditions on x. Count- 
ing the number of indices between / and n, we see that 
n—1+1=r. Adding l to both sides and applying the 
previous inequality, we find n+ 1=2r, or better, n<2r. 
If rÆn, then the proof would be complete; however, this 
may not be the case. Assume that n<r. Let y=ox,, 
Z=,X, w=), We have x=FA, and all tapes yz™w are 
in T(9[). Let m be the least integer such that 


rÆk+m(l—k) + (n—)). 

Clearly m0, since k+(n—l)<n<r. If 
2rÆk+m(l—k) + (n—l), then 
r=k+(m—I)(l—k) + (n—)), 


because /—k==n<r. But this is impossible because m was 
chosen as the least such integer. Hence 


k4+m(l—k) + (n—l) <2r 


and the number on the left is the length of yz”w, which 
proves that there is some tape in T(9[) of the indicated 
length. 

Corollary 9.1. Given a finite automaton XX, there is an 
effective procedure whereby in a finite number of steps 
it can be decided whether T(M) is infinite. 


Corollary 9.2. Let 9{ be a finite automaton with r in- 
internal states, and let the alphabet & have q>1 symbols. 
Then if T(Q) is finite, it can have at most 


5 gia Fal tapes. 
ker q— 1 

Notice also that Lemma 8 gives another proof that 
the set of tapes of the form 0”10” is not definable by any 
finite automaton. 

Finally we shall treat in this section the question of 
deciding whether two automata define the same set of 
tapes. 

Definition 8. Two automata ŅY and $ are equivalent 
if TQ) =T(B). 

Theorem 10. Two automata 9 and B are not equiva- 
lent if and only if there is a tape x of length less than the 
product of the number of internal states of Y by that of 
B which is accepted by one machine but not by the other. 

Proof: Let Y’ be the machine having the same internal 
states as ÑY and defining the complement of T() as in 
the proof of Theorem 5. Similarly for Y. A and Y are 
not equivalent if and only if one of the sets T(x W), 
T(XW xH) is not empty. The theorem follows now di- 
rectly from Theorem 7, Theorem 6 and Definition 7. 

Corollary 10.1. Given two finite automata Yi and ®, 
there is an effective procedure whereby in a finite num- 
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ber of steps it can be decided whether Ņ and H are 
equivalent. 


All the results of this section are quite evident from 
the literature, e.g., Burks-Wang,! Section 2.2. Only 
Theorem 10 and its corollary are a little stronger than 
the corresponding results there because of a wider defini- 
tion of equivalence of automata. These results are none- 
theless included for completeness, since the general ap- 
proach here is rather different. 


Chapter Hl. Reductions to one-way automata 
e 5. Nondeterministic operation 


The automata used throughout Chapter I were strictly 
deterministic in their tape-reading action, which was 
uniquely determined by the table of moves, since there 
was one and only one way the machine would change its 
state in any particular situation. Requiring all machines to 
be of this form can lead to rather cumbersome details, in 
view of the large number of internal states needed even 
for some relatively elementary operations. In this section 
we introduce the notion of a nondeterministic automa- 
ton and show that any set of tapes defined by such a 
machine could also be defined by an ordinary automa- 
ton. The main advantage of these machines is the small 
number of internal states that they require in many 
cases and the ease in which specific machines can be de- 
scribed. Several examples of their use will be found in 
Section 6. 

Definition 9. A nondeterministic (finite) automaton 
over the alphabet = is a system N=(S,M,So,F) where S 
is a finite set, M is a function of SX with values in the 
set of all subsets of S, and So and F are subsets of S. 

A nondeterministic automaton is not a probabilistic 
machine but rather a machine with many choices in its 
moves. At each stage of its motion across a tape it will 
be at liberty to choose one of several new internal states. 
Of course, some sequence of choices will lead either to 
impossible situations from which no moves are possible 
or to final states not in the designated class F. We disre- 
gard all such failures, however, and agree to let the ma- 
chine accept a tape if there is at least one winning com- 
bination of choices of states leading to a designated final 
state. The next definition makes this convention precise. 

Definition 10. Let 9{ be a nondeterministic automaton. 
The set T(%) of tapes accepted by Y is the collection of 
all tapes x=0 90, ...0,—1 for which there exists a se- 
quence So,51,..+55,, Of internal states of Y such that 
(i) Sp is in So; 

Gi) 5; is in M(s;_1,0;_1), for i=1,2,..,n; 
(iii) 5s, isin F. 

It is readily seen that if 9{ is a nondeterministic ma- 
chine such that M(s,0) consists of exactly one internal 
state for each s in S and ø in 3, then 9 is réally the same 
as an ordinary automaton, and T(9[) will contain the 
expected tapes. Thus ordinary automata are special cases 
of nondeterministic automata, and we shall freely iden- 
tify the ordinary machines with their counterparts. 

One might imagine at first sight that these new ma- 


chines are more general than the ordinary ones, but this 
is not the case. We shall give a direct construction of an 
ordinary automaton, defining exactly the same set of 
tapes as a given nondeterministic machine. 

Definition 11. Let A=(S,M, SpF) be a nondeter- 
ministic automaton. D(A) is the system (TLN,t,G) 
where T is the set of all subsets of S, N is a function on 
TX such that N(t,c) is the union of the sets M(s,c) 
for s in t, to™So, and G is the set of all subsets of S con- 
taining at least one member of F. 

Clearly D(QM) is an ordinary automaton, but it is ac- 
tually equivalent to QJ. 

Theorem 11. /f Ņ is a nondeterministic automaton, 
then T(M)=T(DA)). 

Proof: Assume first that a tape x=a90,...0,_ 1 İS in 
T(J) and let s9,5;,-..+,5, be a sequence of internal 
states satisfying the conditions of Definition 10. We 
show by induction that for k==n, s, is in N (fooxy). For 
k=0, N (iooxXr) =N (tp A) =to=5o and we were given 
that sy is in Sy. Assume the result for k— 1. By definition, 
N (topor) = NON (to,9X~—1)0%—1). But we have as- 
sumed sp1 is in N{to,9X,-.1) So that from the definition 
of N we have M(s,_1.0;~1)C N(to.0%,). However, sy 
is in M(s;,_1,¢;--1), and so the result is established. In 
particular s, is in N(to.9%,) =N (tox), and since s, is in 
F, we have N(t,,x) in G, which proves that x is in 
T(D (QQ) ). Hence, we have shown that 


TMC TDD). 


Assume next that a tape x=o,ge,..-0,_, iS in 
TIDA) ). Let for each k=n, ty==N(to,9%,). We shall 
work backwards. First, we know that ¢, is in G. Let 
then s, be any internal state of 9[ such that s, is in ¢, 
and 5, isin F. Since s, is in 


=N (tosoXn) EN (tna) 


we have from the definition of N that s, is in 
M (Sa—i9n-1) for some s,_1 in t,_,. But 


In 1 =N CforoXn—1) =N (in 20n —2); 


so that s,_, is in M(S 20,2) for some $s,» in tyg 
Continuing in this way we may obtain a sequence, 
SaSn—1Sa—2 ->50 Such that s, is in tẹ; sẹ is in 
M(5;,_1,0,%-1), for k>0; and s, is in F. Since to= So, 
we also have sọ in Sọ, which proves that x is in T(Q). 
Thus, (CO(N) )C TM), which completes the proof. 

This theorem has many interesting consequences. For 
example, it shows that any automaton with several ini- 
tial states can be replaced by an equivalent automaton 
with but one initial state. It would seem that the notions 
of final state and initial state should be dual in some 
sense. But one must be careful, because, as the reader 
may easily show for himself, with the alphabet £= {0,1} 
the set of all tapes of the form 0” or 1" cannot be de- 
fined by any nondeterministic automaton with but one 
designated final state. The correct notion of duality be- 
tween initial and final states is connected with the re- 
versal of right and left, as indicated in the next definition 
and theorem. 


Definition 12. Let 9{=(5,M,5),F) be a nondetermin- 
istic automaton, The dual of is the machine W*= 
(S,M*,F,S)) where the function M* is defined by the 
condition 


y isin M*(s,e) if and only if s is in M(s’,o). 


Notice that we have at once the equation M** =Y. 
The relation between the sets defined by an automaton 
and its dual is as follows. 


Theorem 12. 7f 9{ is a nondeterministic automaton, 
then T(M*)=TOD*. 

Proof: In view of the equality 9[**= 9, we need only 
show T(M*)C T(M)*. Let x=ro01 --- on—1 be a tape 
in 7(2{*); we must show that x* is in T(9). Let 59,5), 
. ++ 5, be the sequence of internal states of 9[* such 
that sẹ is in F, s, is in Sg and sy is in M*(s,_.1.0%—1) 
for k=1,2,...,n. Define a new sequence Sosis... 
s’, by the equation s’,==5,_; for k&n. Obviously, s'o 
is in Sọ and s’, is in F. Further, for k>0 and kn, 
51 =5_,—441 ÍS in M*(S,_3,0,—,%), OF in other words, 
Sa— =S IS in M (Spin). Now defining a new se- 
quence of symbols g'o} ... o’,..; by the formula o’,= 
O,—K-~-1, We see that o”,_4=0,_, and oo’, +. 
x*. Thus, x* is in T(9{} as was to be proved. 

It should be noted that Theorem 12 together with 
Theorem 11 yields a direct construction and proof for 
Theorem 4 of Section 3 which was first proved by the 
indirect method of Theorem 1. In the next section we 
make heavy use of the direct constructions supplied by 
the nondeterministic machines to obtain results not 
easily apparent from the mathematical characterizations 
of Theorems 1 and 2. 


a — 
O n1 


@ 6. Further closure properties 


Simplifying a result due originally to Kleene, Myhill in 
unpublished work has shown that the class T can be 
characterized as the least class of sets of tapes containing 
the finite sets and closed under some simple operations 
on sets of tapes. We indicate here a different proof using 
the method developed in the preceding section. 

First of all, we need to define the operations on sets of 
tapes. Let U and V be two sets of tapes. By the complex 
product UV of U and V we understand the collection 
of all tapes of the form xy with x in U and y in V. 
Clearly the product of sets satisfies the associative law: 


(UV)W=U(VW). 


This leads to the introduction of finite exponents where 
we define U"=UU...U(n times) with the convention 
than U°={A}. Finally, if U is a set of tapes we can 
form the closure of U, in symbols ci(U), which is the 
least set V containing U, having A as an element, and 
such that whenever x, y are in V then xy is in V. Another 
definition is given by the equation 


cl(U) =U JU UUU., 


where the infinite union extends all over finite exponents. 
We may prove at once about these operations that the 
class J is closed under them. 
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Theorem 13. The class T is closed under the forma- 
tion of complex products and closures of sets in T. 

Proof: Assume first that U, V are in J. Let U=T(M) 
and V=7(%) where 9{ and H are ordinary automata 
with {= (§,M,s9,F) and B=(T,N,t),G). We need only 
find a nondeterministic machine ( such that VV=T(€). 
We may assume that the sets S and T have no elements 
in common, and then equate ©=(S\/T7,P,{59},G) 
where the function P is defined as follows: 


P(s,0) ={M(s,c)}, if s is in S—F; 
P(s,¢) ={M(s,0),N(To,0) }; 
Pto) =N (te), 


if s is in F; 
if t is in T. 
The straightforward proof that Œ has the desired prop- 
erty is left to the reader. 

Next, we must show why Hci(U) is in J. We con- 
struct a machine ® such that cU) =T(9), where © is 


allowed to be nondeterministic. Simply let D= 
(S,Q,s,,/), where the function Q is defined as follows: 


QO(s,0) ={M (s.r) }, 
O(s,0) = {M (5,0) „M (Sor) E 


The easy completion of the proof is left to the reader. 

Theorem 14. (Kleene-Myhill). The class T is the least 
class of sets of tapes containing the finite sets and closed 
under the formation of unions, complex products, and 
closures of sets. 

The full proof of Theorem 14 will not be given. In- 
stead we give a brief account of the method of proof 
needed. Let U be the least class closed under the opera- 
tions mentioned in the theorem. That UC T is the con- 
tent of Theorems 5, 5.1, and 13. To prove that FC U, 
consider each set in J to be of the form T({), where 9f 
is nondeterministic, and proceed by a kind of induction 
on 9J. In more precise terms, define the weight of Y, in 
symbols |9[/, to be the sum of all the cardinal numbers 
of the sets M (s,¢) for all s in S and g in 3. Then by as- 
suming that 7(%) is in U for all Y with jBi< iM], one 
can prove that T(9[) is also in U. The details, however, 
are tiring. 


if s is in S—F; 


if s isin F. 


This discussion completes our survey of the closure 
properties of the class of definable sets begun in Section 
3, and the authors are not aware of any other interesting 
operations on sets that can be effected by constructions 
of automata that we have not already indicated. The re- 
mainder of this paper will be therefore devoted to gen- 
eralizations of the notion of an automaton. 


% 7. Two-way automata 


Trying to further generalize the notion of an automaton, 
we consider automata which are not confined to a strict 
forward motion across their tapes. This leads to the fol- 
lowing definition, which is a direct extension of Defini- 
tion 1. 

Definition 13. Let L={—1,0,+ 1}. A two-way (finite) 
automation over a finite alphabet £ is a system Y= 
(S,M,So F) where S is a finite non-empty set (the set of 
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internal states of Y), M is a function from SX into 
L XS (the table of moves of Y), sọ is an element of S 
(the initial state of Y), and F is a subset of S (the set of 
designated final states of Y). 

A two-way automaton X operates as follows: When 
given a tape, i.e., a finite linear sequence of squares each 
containing a single symbol of the alphabet £, 9 is set in 
internal state sọ scanning the first (leftmost) square of 
the tape. At each stage of the machine’s operation, if the 
internal state is s, the scanned symbol is e, and 
M(s,o) = (p,s’), where p is one of —1,0,1, then 9 will 
move one square to the left, stay where it is, or move 
one square to the right, according as p= —1,0,1; further- 
more, Ņ will enter internal state s. The operation de- 
scribed just now is called an atomic step of A. After 
completion of an atomic step, 9{ is again in a certain 
internal state scanning a certain symbol, and a new 
atomic step is performed, and so on. 

If, when operating in this way on a given tape, 9 will 
eventually get off the tape on the right side and at that 
time be in a state in F, then we shall say that the tape is 
accepted by N. The formal definition is as follows: 

Definition 14, The set T(3) of tapes accepted by the 
two-way automaton Y is the set of all sequences oy... 
a, Of symbols from the alphabet = for which there 
exist an integer m>O, a sequence of integers po,..., 
Pm, Gnd a sequence Sp, ..., Sm of internal states of Y 
such that 
(i) pg=0 and sy is the initial state of %; 

Gi) O==p;<n for i=0,...,m—1; 
Gii) pp=n and Sm is in F; 
Gy) (Pi Pi) =M (Si p0p) fori=1,..., m. 

In the above definition the sequence py,...; Pm 
should be interpreted as the sequence of positions of the 
machine 9% on the tape; thus, pj—p;., indicates the 
change in position of the machine from time i— 1 to time i. 
Condition (ii), for example, means that the machine 
does not run off the tape before the computation has 
been completed. 

In analogy with Definition 3 we shall say that a set P 
of tapes is definable by a two-way automaton if there 
exists some two-way automaton 9{ such that T (Q) =P. 

To avoid confusion we shall, from now on, refer to 
the automata discussed in Sections 1-6 as one-way au- 
tomata. 

Let us consider an example of a two-way machine 
illustrating the complicated fashion in which such a ma- 
chine can operate on a given tape. Let Y, B, and @, be 
three one-way automata over the same alphabet £. We 
combine these automata into a single two-way automa- 
ton D having the following flow diagram. Given a tape t 
the automaton 9 (which we imagine as being a part of 
D) starts reading it on the left end and proceeds from 
left to right until a designated final state of Y is reached; 
when this happens D goes into the initial state of 8 and 
starts reading the tape from right to left until a desig- 
nated final state of % is reached; when this happens D 
switches into the initial state of 9{ and again starts mov- 
ing from left to right, and so on; all this time automaton 


© is reading the tape symbols as they come in (i.e., in 
the sequence in which they are being scanned by D) and 
t is accepted by Ð only if D ever gets off the right-hand 
end of ¢ and at that time © is in one of its final desig- 
nated states. It seems to be quite difficult to determine 
the kind of set of tapes defined by ®. It turns out, to our 
surprise, that the following theorem holds. 

Theorem 15. For every two-way automaton YW there 
exists a one-way automaton yi such that TOD =T D. 
Furthermore, 9 can be obtained effectively from %. 

Outline of Proof:* By definition, a Z-motion of 9[ on 
a tape ¢ consists of 9{ moving across a square x in a cer- 
tain direction up to a square y, changing direction at y 
and moving back towards x, changing again direction 
before passing x and moving up to y; a Z-motion thus 
contains exactly two changes of direction. While oper- 
ating on a tape t a two-way automation will in general 
perform a complicated succession of forward and back- 
ward motions before accepting or rejecting t. In par- 
ticular, 9{ will go through a great number of Z-motions. 

In a given Z-motion in the diagram, 


the internal state s’ in which 9 re-enters y is a function 
of the state s in which 9{ originally entered y and the 
portion of the tape from x to y. If it were possible to 
compute this new state s’, without actually having to 
move back, then we could substitute for 9{ a new autom- 
aton which, instead of turning back at y, would simply 
go directly from state s into state s’ and thus the Z- 
motion would be eliminated. It turns out that the com- 
putation of s’ from s is indeed possible because the set 
R(s,s8’) (L(s,s’)) of tapes such that when 9 starts on 
the right-(left) hand end it will go through a simple loop 
(i.e., move directly to some square, change direction 
there, and go straight back to where it started) and ar- 
rive back in state s’, is definable by a one-way automa- 
ton. Combining 9{ with these one-way automata it is 
possible to define a new derived automaton X which on 
any given tape ft performs fewer Z-motions than 9 does 
and such that 7T(9(’)=T7(2). We then show that, by 
repeating this derivation operation a sufficient number 
of times, a one-way automaton is obtained which de- 
fines the same set as 9{. This depends on the fact that 
there is a bound, common to all tapes ¢ accepted by Y, 
on the number of times 9{ goes through any square of t; 
this bound being the number of internal states of Y. 

Corollary 15.1. The equivalence problem for two-way 
automata is effectively solvable. 

Proof: Given two two-way automata 9[ and %, to de- 
cide whether T(Y) =T(%) construct one-way automata 
M and H such that T(Q)=T(W) and T(B)=T(B); 


*The result, with its original proof, was presented to the Summer In- 
stitute of Symbolic Logic in 1957 at Cornell University. Subsequently 
J. C. Snepherdson communicated to us a very elegant proof which 
also appears in this Journal.” In view of this we confine ourselves 
here to sketching the main ideas of our proof. 


by the previous theorem this can be done effectively. 
Apply now to 9 and % the procedure given in Corol- 
lary 10.1. 


Chapter Ill. Multitape automata 


è 8. Description and definitions 


We turn now to the study of multitape machines, fixing 
our attention, without any real loss of generality, on the 
two-tape case. We can picture the two-tape machine 
as having two scanning heads reading a pair (tot) of 
tapes. We adopt the convention that the machine will 
read for a while on one tape, then change control and 
read for a while on the other tape, and so on until one 
of the tapes is exhausted. When this happens 9{ stops and 
the pair (t,t) is accepted if and only if Q{ is in a desig- 
nated final state. Thus, with a two-tape automaton, a set 
of pairs of tapes is defined, or we can say a binary rela- 
tion between tapes is defined. 

To make two-way automata more versatile we afford 
them with the ability to anticipate the end of the tape. 
This arrangement consists in augmenting the alphabet = 
with an end-marker s and always feeding into the au- 
tomaton pairs of the form (fye,tj;e); here t, and tọ do 
not contain e, the latter being merely a technical symbol. 

In order to indicate the change of control from one 
tape to the other we use the device of dividing the states 
of the machine into two classes: the first class contains 
those states in which the first tape is being read, while 
the second class has to do with the second tape. These 
remarks should serve as sufficient background for the 
following formal definition. 

Definition 15. A two-tape, one-way automaton over 
an alphabet X is a system AY=(S,M359,F,Cy,C,) where 
(S,M,5),F) is an ordinary automaton; except that M is 
a function from SX (X\_j{e}) into S, and where the sets 
Cy,C, form a partition of S, ie, Co Cy=d and 
CoC =S. 

Thus a two-tape machine is just an ordinary automa- 
ton having an additional structure to determine which 
tape is to be read. 

To be able to define explicitly when a pair of tapes is 
accepted by an automaton, the following notation in- 
volving the partition of the set of states is needed. 

Let 9[=(S,M,59,F,Co,C,) be a two-tape automaton 
and let sy,5,,..., Sn be a sequence of states (where sy 
is the initial state). Then there is a unique pair of asso- 
ciated sequences of integers ky,..., kurlo, -> 1, such 
that: 

(i) kis O or 1 according as s; is in Co or Cy; 
(ii) l is the number of indices j<i such that s, is in 
Crp 

Definition 16. The set of all pairs of tapes accepted 
by a two-tape automaton Y, in symbols Ta(X{), is the 
set of all pairs (t,t) on the alphabet X such that for 


(tye,toe) = ( T00001 - + Fo¢m—1)F10F11 > < F1(n—1)) 
there is a (unique) sequence of states Soọ,S1, . . . , Sp and 
associated sequences of integers ko... , kpi Igy. - +5 lp 


such that 
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(i) sgis the initial state of I: 

Gi) y= M(ox, 4), ye, 4) Jor i=1,..., p; 

(iii) if kp—1=0 then 1,-1=2m—1 and if k,_,=1 
then lp =n- l; 

Gv) s,isinF. 


In the above definition we are, of course, assuming 
that if k;=0, then }<m and if k;=1, then /,<n other- 
wise condition (ii) would be meaningless. 


è 9, Relation to one-tape automata 


Two-tape automata behave in a fashion almost identical 
with that of one-tape automata, the only difference being 
that they operate on two tapes. Jt is therefore natural to 
try to establish relationships between the sets of pairs of 
tapes definable by two-tape machines and the sets of 
tapes definable by one-tape automata. 

Theorem 16. Let = (S,M,5),F,C),C,) be a two-tape 
automaton. The set of all tapes t, for which there exists 
some tape ty such that (t,t) is in To{M) (ie. the 
domain of the relation defined by YX) is definable by a 
one-tape automaton, An automaton defining this set can 
in fact be constructed effectively from XN. 

Proof: The idea underlying the proof is that on the 
first component of any pair of tapes (91) operates like a 
nondeterministic one-tape machine. Once we are able to 
define the one-tape, nondeterministic automaton accept- 
ing precisely the tapes ¢, for which (f,,f2) is in Ta (90) 
for some fy the proof is completed by Theorem 11. 

To shorten the argument we shall consider a slightly 
simplified version of the notion of two-tape automata; 
namely, in Definitions 15 and 16 we disregard the end 
symbol £ and the special role it plays (it is possible to 
extend the proof to cover the general case). A pair 
(fista) is thus fed directly into { and is said to be ac- 
cepted if and when 9 gets off one of the tapes in a desig- 
nated final state of Y. 

Let s” be in Co and s’ be in C,. A tape ż on the alpha- 
bet = is called a (s,s) transition tape if 9{, when started 
on fin s’ will go through states in C, until it gets off ¢ in 
s”. For every pair (s’,s”) for which there exists some 
transition tape let t(s’,s’) denote a shortest one. The 
length of t(s’,s”) is clearly less than the number of 
states in C, so that all shortest transition tapes, and 
hence all pairs of states possessing a transition tape, can 
be effectively found. 

A state s’ in C, will be called a finalizing state if there 
exists a tape t(s’) such that Y, when started on #(s’) in 
s’, will go in states of C, to the end of t(s’) and get off 
the tape in a designated final state of 9. 

Define now a nondeterministic one-tape automaton 8% 
as follows. Let f be some new element not in S, the set 
of states of B is Co /{f}. The table N of moves of B 
is defined by 

(i) N(fo)={f}; 

Gi) N(s,c)={M(s,0)}, if M(s,0) is in Co; 

(iti) N(s,o)=(f}, if 4(s,c) is a finalizing state in C,; 
(iv) N(s,o) =the set of all s” where there exists a tran- 
sition tape 1(M(s,c),s’’), otherwise. 
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The set of initial states of ® is {sg} if sọ is in Cy and is 
the set of s” for which there exists a tape t(59,s’’), if So 
is in C,. The set of designated final states of B 
(CoOVFY- 

It is left for the reader to verify that the set of all 
tapes accepted by % is precisely the domain of the rela- 
tion T (X); we recall at this point the simplified defini- 
tion of acceptance used in the proof. This completes our 
proof. 

Corollary 16.1. There are effective procedures where- 
by, given a two-tape automaton Y, it can be decided in 
a finite number of steps whether Ts(Q) is empty and 
whether Ta (Q) is infinite. 

Proof: Construct the one-tape automata 8 and @ 
defining the domain and range of the relation T2(9). 
The set T,(2) is empty if and only if T(Q) is empty. 
The set T(J) is infinite if and only if at least one of 
T(%8) and T(%) is infinite. Now apply Corollaries 7.1 
and 9.1. 

Corollary 16.2. If T,(Q{) contains only pairs of the 
form (t,t) (i.e., U& defines a diagonal relation) then the 
set of all tapes t for which (t,t) is in T.(%) is definable 
by a one-tape automaton. 


è 70. Impossibility of Boolean operations 


Whereas the class of sets definable by one-tape automata 
is closed under the Boolean operations (Theorem 5), 
when we come to sets of pairs definable by two-tape au- 
tomata the situation is markedly different. 

Theorem 17. The class of all sets definable by twe- 
tape automata is (i) closed under complementation; 
(ii) is not closed under intersection and union.* 

Proof: (i) Let = (5,M,59,F,Cy,C,). The comple- 
ment, with respect to the set of all pairs of tapes on E, 
of T2(91), is T2((S,M,50,S —F,Co,C1) ). (ii) Let $= {0,1} 
and use the notation 0” to denote the tape containing n 
zeroes. The sets U={(0"10™, 0°10") ,n,m1,k=1,2,... } 
and V=:{(4,¢), t runs through all tapes} are definable by 
two-tape automata. Now 8&f\D={(0710", 0°16"), 
n=1,2,...}. If this set were definable by a two-tape 
automaton then, by Corollary 16.2, the set {0"10*, 
n==1,2,...} would be definable by a one-tape automa- 
ton, which is impossible. That the class of definable sets 
is not closed with respect to unions now follows from the 


identity 10B=T-[(T—-W)(T—B)] and (i). 
e 1]. Unsolvability of the intersection problem 


We have shown that the emptiness problem for two-tape 
automata is effectively solvable. It will now turn out that 
a similar elementary problem is not solvable. As a prepa- 
ration for this result concerning automata we must recall 
a theorem of E. Post.® 

The correspondence problem is the following: Given 
two equally long ordered lists @,,ay,...,a, and bi, bp, 
,..,6, Of tapes on the alphabet =, to decide whether 
there exist a sequence of indices i,,i:,...,%,, where 


*J, C. Shepherdson informed us in a letter about a different simple 
example for the fact that the class of definable relations is not closed 
under intersections. 


1==i,<2n, such that 


Ai, dy 


ij “ig 
E. Post proved that the correspondence problem (for an 
alphabet with more than one letter) is not effectively 
solvable. 

Theorem 18. The problem whether for two finite two- 
tape automata Y, and Wy we have T(N) OTa (Az) =Q 
(the empty set) is not effectively solvable. 

Proof: Corresponding to every sequence, 4;,do,..., áy 
of words on our alphabet E construct a set P(a@,,@),..., 
a,) of pairs of tapes as follows: We may assume that 
0,1 are in £; if i is an integer let i be the tape consisting 
of i symbols 1 followed by a single 0. Now (7;,fg) is in 
P(ay,do,...,4,) if and only if for some k 


e.a ai= bh bi, aoe bi, 


(G) ty=4,, ig + GQ 
Gi) =ils... i, 


k 3 
where i;==n. 


It is not hard to construct a two-tape automaton 
(ay,d>,.--,4,)=9{ such that 7,0) =P(a,,ae, ...,4,). 
Namely, to check whether a pair (t,,t,) satisfies condi- 
tions (i) and (ii), 9{ will start on tf, and count the num- 
ber of symbols 1 until the first O is met, let this number 
be i,. The machine then switches to f} and checks 
whether this tape begins with a; if it does not, then 
(tifa) is not accepted. If t, does begin with a,,, then 
after reading through a;, the machine switches back to 
tg, and the whole process is repeated. If at any time a 
symbol other than 0 or 1 is found on tz, or if fg contains 
a run of more than n symbols 1 or more than one symbol 
0, then the pair (t,,fo) is not accepted. These remarks 
sufficiently indicate the construction of 9{ and we shall 
not go into further detail. 


Given two sequences of words S;=(d,,do,...,d,) 
and Sp=(b,,b9,...,5,) then P(a,,d9,...,4,)\P(4, 
by,..., 6,)-=@ if and only if the Post correspondence 


problem of S, and S, has a solution. Since the corre- 
spondence problem is not effectively solvable it follows 
that the problem whether 


Ta (MCa, E ap) A To(M (44, CERA ba) Fe 


is not effectively solvable. 


® 12. Two-way, two-tape automata 


Turning now to two-way, two-tape automata we find 
that all hope of any constructive decision processes is 
lost. It is even impossible to decide, by a constructive 
decision method applicable to all automata, whether a 
two-way, two-tape machine accepts any tapes. To prove 
this formally it is, of course, necessary to give the explicit 
definition of a two-way machine. We shall not give the 
details here, since they are Jong and not very much dif- 
ferent from the formal definitions needed for two-way, 
One-tape automata. The main point is that, as with the 
two-way, one-tape automaton, the table of moves of a 
two-way, two-tape automaton sometimes requires the 
machine to back up from the scanned square. However, 
an outline of the proof should clarify the method. 

It was shown above that there is no constructive deci- 


sian method for deciding whether two two-tape, one- 
way machines 9, and Y, both accept a common pair of 
tapes, that is, whether T2(%,)(\To(M%s)+¢. From the 
construction of the two-tape machines it follows that if 
h is a new symbol not in the alphabet £, then there is a 
one-one correspondence between all two-tape, one-way 
machines 9 over È and certain two-tape one-way ma- 
chines Y over X\_j{h} such that a pair of tapes (t,t2) 
is in T2(9) if and only if (Atih, hth) is in T,(M’). In 
words, we simply put a marker at the ends of the tapes, 
and all accepted tapes must be of this form. Let now M 
and Ma be any two two-tape machines. The correspond- 
ing machines over S\ ){h} are W4 and W. Now be- 
cause 9’, and Wa only accept tapes with markers 
at the ends, they can be glued together into a two- 
way machine H such that Ta (B) =T W DAO Ta (NW). 
The two-way motion of $ is obvious: first run through 
the tapes in the style of 9[, to see if the pair is accepted, 
and then, after hitting the markers at the right end, run 
backwards until the left markers are hit, at which time 
the motion is again reversed, and the machine is started 
over, running in the style of Yp. 

The outline of the construction given above shows 
that every intersection problem about one-way machines 
9, and N, is equivalent to the intersection problem about 
machines J’, and Ws, which in turn is equivalent to the 
emptiness problem for a two-way machine $. Since there 
is an effective method for showing these equivalences, 
and since there is no effective solution of the intersection 
problem for one-way machines, we have proved the 
following. 

Theorem 19. There is no effective method of deciding 
whether the set of tapes definable by a two-tape, twa- 
way automaton is empty or not. 

An argument similar to the above one will show that 
the class of sets of pairs of tapes definable by two-way, 
two-tape automata is closed under Boolean operations. 
In view of Theorem 17, this implies that there are sets 
definable by two-way automata which are not definable 
by any one-way automaton; thus no analogue to Theo- 
rem 15 holds, 
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